Search CORE

77 research outputs found

ORENZA: a web resource for studying ORphan ENZyme activities

Author: A Bairoch
A Bairoch
A Fleischmann
Bernard Labedan
DG Naumoff
DL Wheeler
HM Berman
I Schomburg
M Kanehisa
ML Green
N Hulo
O Lespinet
O Lespinet
Olivier Lespinet
PD Karp
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Despite the current availability of several hundreds of thousands of amino acid sequences, more than 36% of the enzyme activities (EC numbers) defined by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) are not associated with any amino acid sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found for nearly all classes of enzymes. Thus, there is an urgent need to explore these sequence-less EC numbers, in order to progressively close this gap. DESCRIPTION: We designed ORENZA, a PostgreSQL database of ORphan ENZyme Activities, to collate information about the EC numbers defined by the NC-IUBMB with specific emphasis on orphan enzyme activities. Complete lists of all EC numbers and of orphan EC numbers are available and will be periodically updated. ORENZA allows one to browse the complete list of EC numbers or the subset associated with orphan enzymes or to query a specific EC number, an enzyme name or a species name for those interested in particular organisms. It is possible to search ORENZA for the different biochemical properties of the defined enzymes, the metabolic pathways in which they participate, the taxonomic data of the organisms whose genomes encode them, and many other features. The association of an enzyme activity with an amino acid sequence is clearly underlined, making it easy to identify at once the orphan enzyme activities. Interactive publishing of suggestions by the community would provide expert evidence for re-annotation of orphan EC numbers in public databases. CONCLUSION: ORENZA is a Web resource designed to progressively bridge the unwanted gap between function (enzyme activities) and sequence (dataset present in public databases). ORENZA should increase interactions between communities of biochemists and of genomicists. This is expected to reduce the number of orphan enzyme activities by allocating gene sequences to the relevant enzymes

Crossref

Springer - Publisher Connector

PubMed Central

A survey of orphan enzyme activities

Author: A Bairoch
A Bairoch
A Barrett
A Barrett
B Briggs
D Naumoff
DL Wheeler
E Pennisi
J Melnick
JD Peterson
K Tipton
MY Galperin
O Lespinet
O Lespinet
O Lespinet
O Lespinet
P Bork
P Karp
P Romero
PD Karp
Peter D Karp
RJ Roberts
RJ Roberts
RV Misra
T Cheng
TJ Lee
W Nishii
X Chen
Y Wang
Yannick Pouliot
Publication venue: BioMed Central
Publication date: 01/07/2007
Field of study

Abstract Background Using computational database searches, we have demonstrated previously that no gene sequences could be found for at least 36% of enzyme activities that have been assigned an Enzyme Commission number. Here we present a follow-up literature-based survey involving a statistically significant sample of such "orphan" activities. The survey was intended to determine whether sequences for these enzyme activities are truly unknown, or whether these sequences are absent from the public sequence databases but can be found in the literature. Results We demonstrate that for ~80% of sampled orphans, the absence of sequence data is bona fide. Our analyses further substantiate the notion that many of these enzyme activities play biologically important roles. Conclusion This survey points toward significant scientific cost of having such a large fraction of characterized enzyme activities disconnected from sequence data. It also suggests that a larger effort, beginning with a comprehensive survey of all putative orphan activities, would resolve nearly 300 artifactual orphans and reconnect a wealth of enzyme research with modern genomics. For these reasons, we propose that a systematic effort to identify the cognate genes of orphan enzymes be undertaken.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Survival trade-offs in plant roots during colonization by closely related beneficial and pathogenic fungi

Author: Dallery J.
Damm U.
Garrido Oter R.
Hacquard S.
Hainaut M.
Henrissat B.
Hiruma K.
Kemen E.
Kracher B.
Lespinet O.
McHardy A.
Muech P.
O'Connell R.
Sacristan S.
Schulze-Lefert P.
Thon M.
Ver Loren van Themaat E.
Weimann A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2016
Field of study

MPG.PuRe

Identification and Gene Expression Analysis of a Taxonomically Restricted Cysteine-Rich Protein Family in Reef-Building Corals

Author: AI Saeed
Alejandro Reyes-Bermudez
BC Schutte
C Notredame
Christian R. Voolstra
DJ Miller
DJ Miller
E Gasteiger
FA Kondrashov
G Jobb
H Wu
I Fukuda
JA Schwarz
K Nakayama
KA Brogden
LC Grasso
M Kimura
M Lynch
M Medina
M Rholam
M Schena
MA Nowak
MB Eisen
Michael K. DeSalvo
MK DeSalvo
Mónica Medina
NH Putnam
O Emanuelsson
O Lespinet
R Chenna
RD Kortschak
RD Page
RE Steele
Robert DeSalle
S Ohno
S Rozen
SF Altschul
Shinichi Sunagawa
SL Romano
T Watanabe
TA Hall
U Technau
X Cui
X Huang
Publication venue: Public Library of Science
Publication date: 13/03/2009
Field of study

The amount of genomic sequence information continues to grow at an exponential rate, while the identification and characterization of genes without known homologs remains a major challenge. For non-model organisms with limited resources for manipulative studies, high-throughput transcriptomic data combined with bioinformatics methods provide a powerful approach to obtain initial insights into the function of unknown genes. In this study, we report the identification and characterization of a novel family of putatively secreted, small, cysteine-rich proteins herein named Small Cysteine-Rich Proteins (SCRiPs). Their discovery in expressed sequence tag (EST) libraries from the coral Montastraea faveolata required the performance of an iterative search strategy based on BLAST and Hidden-Markov-Model algorithms. While a discernible homolog could neither be identified in the genome of the sea anemone Nematostella vectensis, nor in a large EST dataset from the symbiotic sea anemone Aiptasia pallida, we identified SCRiP sequences in multiple scleractinian coral species. Therefore, we postulate that this gene family is an example of lineage-specific gene expansion in reef-building corals. Previously published gene expression microarray data suggest that a sub-group of SCRiPs is highly responsive to thermal stress. Furthermore, data from microarray experiments investigating developmental gene expression in the coral Acropora millepora suggest that different SCRiPs may play distinct roles in the development of corals. The function of these proteins remains to be elucidated, but our results from in silico, transcriptomic, and phylogenetic analyses provide initial insights into the evolution of SCRiPs, a novel, taxonomically restricted gene family that may be responsible for a lineage-specific trait in scleractinian corals

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

An Atlas of the Speed of Copy Number Changes in Animal Gene Families and Its Implications

Author: AV Furano
C Vogel
D Pan
DA Petrov
DA Petrov
Deng Pan
DM Krylov
E Birney
EG Danchin
F Raible
H Frohlich
H Roest Crollius
I Wapinski
J Felsenstein
J Jiang
JA Graves
JK Killian
JP Demuth
JZ Zhang
L Aravind
Liqing Zhang
M Karuppasamy
M Lynch
MJ Wakefield
MW Hahn
N Lopez-Bigas
N Saitou
O Lespinet
P Pavlidis
Pawel Michalak
PM Harrison
RD Kortschak
SB Hedges
T Blomme
TJ Sargeant
Y Bai
Publication venue: Public Library of Science
Publication date: 23/10/2009
Field of study

The notion that gene duplications generating new genes and functions is commonly accepted in evolutionary biology. However, this assumption is more speculative from theory rather than well proven in genome-wide studies. Here, we generated an atlas of the rate of copy number changes (CNCs) in all the gene families of ten animal genomes. We grouped the gene families with similar CNC dynamics into rate pattern groups (RPGs) and annotated their function using a novel bottom-up approach. By comparing CNC rate patterns, we showed that most of the species-specific CNC rates groups are formed by gene duplication rather than gene loss, and most of the changes in rates of CNCs may be the result of adaptive evolution. We also found that the functions of many RPGs match their biological significance well. Our work confirmed the role of gene duplication in generating novel phenotypes, and the results can serve as a guide for researchers to connect the phenotypic features to certain gene duplications

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Evolution of multiple phosphodiesterase isoforms in stickleback involved in cAMP signal transduction pathway

Author: A Force
A Hansen
BS Stevenson
C Yan
C Yan
D Posada
E Birney
F Ronquist
G Blanc
G Jobb
G Jobb
G Koh
GD Tocchini-Valentini
GL Card
H Escriva
H Matsuno
J Zhang
JA Birchler
JD Thompson
JH Thomas
JL Hurst
JP Bielawski
K Strimmer
K Tamura
K Tsunemoto
L Duret
M Freeling
M Kakita
M Kanehisa
M Kasahara
M Nagasaki
M Robinson-Rechavi
M Sémon
ME Mangan
Mutsumi Nishida
N Hulo
O Jaillon
O Lespinet
P Flicek
R Kawahara
RD Finn
RJ Wootton
S Maere
S Ohno
T Aoki
T Hughes
V Douard
WP Maddison
Y Niimura
Y Sato
Y Sato
Yasuyuki Hashiguchi
YH Jeon
Yukuto Sato
Z Yang
Z Yang
Z Yang
Publication venue: BioMed Central
Publication date: 20/02/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Adaptive Evolution in Zinc Finger Transcription Factors

Author: A Bateman
AJ Williams
AT Hamilton
B Gebelein
C Looman
C Underhill
CA Kim
CJ Krebs
CT Workman
D Lugtenberg
D Schmidt
DC Schultz
DC Schultz
DS Wuttke
EJ Bellefroid
EJ Bellefroid
ES Lander
GE Crooks
GN Filipova
GN Filipova
H Peng
James H. Thomas
JC Venter
JD Thompson
JF Margolin
JR Friedman
K Ayyanathan
K Ishihara
KD Pruitt
L Medugno
LC Edelstein
M Anisimova
M Elrod-Erickson
M Horiba
M Shannon
M Wiznerowicz
MJ Garcia-Garcia
MP Foster
O Albagli
O Lespinet
P Dehal
R Urrutia
R Witzgall
RF Ryan
Ryan O. Emerson
S Huntley
S Iuchi
SA Sawyer
SA Shoichet
SB Cannon
SB Carroll
Simon Myers
SP Sripathy
SR Eddy
T Kleefstra
TL Sander
VJ Bardwell
WSW Wong
Y Agata
Y Choo
Z Birtle
Z Yang
Z Yang
ZX Wang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

The majority of human genes are conserved among mammals, but some gene families have undergone extensive expansion in particular lineages. Here, we present an evolutionary analysis of one such gene family, the poly–zinc-finger (poly-ZF) genes. The human genome encodes approximately 700 members of the poly-ZF family of putative transcriptional repressors, many of which have associated KRAB, SCAN, or BTB domains. Analysis of the gene family across the tree of life indicates that the gene family arose from a small ancestral group of eukaryotic zinc-finger transcription factors through many repeated gene duplications accompanied by functional divergence. The ancestral gene family has probably expanded independently in several lineages, including mammals and some fishes. Investigation of adaptive evolution among recent paralogs using dN/dS analysis indicates that a major component of the selective pressure acting on these genes has been positive selection to change their DNA-binding specificity. These results suggest that the poly-ZF genes are a major source of new transcriptional repression activity in humans and other primates

Crossref

Directory of Open Access Journals

PubMed Central

Analysis of Combinatorial Regulation: Scaling of Partnerships between Regulators with the Number of Governed Targets

Author: A Martinez-Antonio
A Wagner
AJ Walhout
Alexej Abyzov
Andrey Rzhetsky
C Jiang
CE Horak
CT Harbison
E Davidson
E Ravasz
GC Conant
H Jeong
H Jeong
HC Mak
Hui Lu
J Ihmels
J Ptacek
JD Han
Koon-Kiu Yan
L Adamic
M Isalan
M Kloster
Mark B. Gerstein
Matthew B. Carson
MM Babu
N Bluthgen
N Guelzim
Nitin Bhardwaj
NM Luscombe
O Lespinet
R Albert
R Guimera
R Hermsen
R Milo
RIM Dunbar
RM Coulson
S Balaji
S Balaji
S Ciliberti
S Gama-Castro
S Maslov
SA Teichmann
SS Shen-Orr
TI Lee
TS Prasad
VV Svetlov
W Wang
Y Pilpel
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Through combinatorial regulation, regulators partner with each other to control common targets and this allows a small number of regulators to govern many targets. One interesting question is that given this combinatorial regulation, how does the number of regulators scale with the number of targets? Here, we address this question by building and analyzing co-regulation (co-transcription and co-phosphorylation) networks that describe partnerships between regulators controlling common genes. We carry out analyses across five diverse species: Escherichia coli to human. These reveal many properties of partnership networks, such as the absence of a classical power-law degree distribution despite the existence of nodes with many partners. We also find that the number of co-regulatory partnerships follows an exponential saturation curve in relation to the number of targets. (For E. coli and Bacillus subtilis, only the beginning linear part of this curve is evident due to arrangement of genes into operons.) To gain intuition into the saturation process, we relate the biological regulation to more commonplace social contexts where a small number of individuals can form an intricate web of connections on the internet. Indeed, we find that the size of partnership networks saturates even as the complexity of their output increases. We also present a variety of models to account for the saturation phenomenon. In particular, we develop a simple analytical model to show how new partnerships are acquired with an increasing number of target genes; with certain assumptions, it reproduces the observed saturation. Then, we build a more general simulation of network growth and find agreement with a wide range of real networks. Finally, we perform various down-sampling calculations on the observed data to illustrate the robustness of our conclusions

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Comparative genomic analysis reveals independent expansion of a lineage-specific gene family in vertebrates: The class II cytokine receptors and their ligands in mammals and fish

Author: A Agrawal
AL Hughes
AL Hughes
AM Fahrer
C Gaboriaud
C Ghiglione
D Danino
E Meurs
E Thoreau
G Lutfalla
G Uzé
HW Chen
I.K. A Makarova, K.S., A Spouge, J.L., A Wolf, Y.I., A Koonin, E.V. Jordan
J Calduch-Giner
J Reboul
JF Bazan
JF Bazan
KE Mogensen
L Du Pasquier
L Du Pasquier
M Hattori
M Lewerenz
MR Walter
MR Walter
N Galtier
N Mackman
NA Nicola
O Lespinet
P Dehal
P Sheppard
PM Murphy
R Kuhn
RM Roberts
S Aparicio
S Brown
SD Spencer
SM Altmann
SV Kotenko
SV Kotenko
T Wang
WH Yap
Y Nakagawa
Y Taya
Publication venue: BioMed Central
Publication date: 01/07/2003
Field of study

BACKGROUND: The high degree of sequence conservation between coding regions in fish and mammals can be exploited to identify genes in mammalian genomes by comparison with the sequence of similar genes in fish. Conversely, experimentally characterized mammalian genes may be used to annotate fish genomes. However, gene families that escape this principle include the rapidly diverging cytokines that regulate the immune system, and their receptors. A classic example is the class II helical cytokines (HCII) including type I, type II and lambda interferons, IL10 related cytokines (IL10, IL19, IL20, IL22, IL24 and IL26) and their receptors (HCRII). Despite the report of a near complete pufferfish (Takifugu rubripes) genome sequence, these genes remain undescribed in fish. RESULTS: We have used an original strategy based both on conserved amino acid sequence and gene structure to identify HCII and HCRII in the genome of another pufferfish, Tetraodon nigroviridis that is amenable to laboratory experiments. The 15 genes that were identified are highly divergent and include a single interferon molecule, three IL10 related cytokines and their potential receptors together with two Tissue Factor (TF). Some of these genes form tandem clusters on the Tetraodon genome. Their expression pattern was determined in different tissues. Most importantly, Tetraodon interferon was identified and we show that the recombinant protein can induce antiviral MX gene expression in Tetraodon primary kidney cells. Similar results were obtained in Zebrafish which has 7 MX genes. CONCLUSION: We propose a scheme for the evolution of HCII and their receptors during the radiation of bony vertebrates and suggest that the diversification that played an important role in the fine-tuning of the ancestral mechanism for host defense against infections probably followed different pathways in amniotes and fish

DSpace@MIT

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Domain architecture evolution of pattern-recognition receptors

In animals, the innate immune system is the first line of defense against invading microorganisms, and the pattern-recognition receptors (PRRs) are the key components of this system, detecting microbial invasion and initiating innate immune defenses. Two families of PRRs, the intracellular NOD-like receptors (NLRs) and the transmembrane Toll-like receptors (TLRs), are of particular interest because of their roles in a number of diseases. Understanding the evolutionary history of these families and their pattern of evolutionary changes may lead to new insights into the functioning of this critical system. We found that the evolution of both NLR and TLR families included massive species-specific expansions and domain shuffling in various lineages, which resulted in the same domain architectures evolving independently within different lineages in a process that fits the definition of parallel evolution. This observation illustrates both the dynamics of the innate immune system and the effects of “combinatorially constrained” evolution, where existence of the limited numbers of functionally relevant domains constrains the choices of domain architectures for new members in the family, resulting in the emergence of independently evolved proteins with identical domain architectures, often mistaken for orthologs

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California